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ABSTRACT: 

A data service system in a data access network system is described. The 
data service system includes a content sen/er that stores content files for 
access by extemal access requests. Each of the content files is stored in a 
full content format and an adapted or degraded content fornnat which is less 
resource-intensive to serve than the full content format. The data service 
system also includes an adaptive load control system which is coupled to 
the content server to pass the access requests to the content server. The 
adaptive load control system modifies an access request to access the 
corresponding content file in the adapted content format when the content 
server is in an overload condition such that the content server can be 
maintained at safe load conditions. A method of maintaining the content 
server at safe load conditions is also described. 
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(54) Web server with an adaptive load control 



(57) A data service system in a data access network 
system is described. The data service system includes 
a content server that stores content files for access by 
external access requests. Each of the content files is 
stored in a full content format and an adapted or degrad- 
ed content format which is less resource-intensive to 
serve than the full content format. The data service sys- 
tem also includes an adaptive load control system which 



is coupled to the content server to pass the access re- 
quests to the content server. The adaptive load control 
system modifies an access request to access the cor- 
responding content file in the adapted content format 
when the content server is in an overload conditk>n such 
that the content server can be maintained at safe load 
conditions. A method of maintaining the content server 
at safe load conditions is also described. 
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Description 

BACKGROUND OF THE INVENTION 

1. Field of the Invention 

[0001] The present invention pertains to Internet/In- 
tranet systems. More particularly, this Invention relates 
to an adaptive web server that serves user access re- 
quests for the web server to less resource intensive con- 
tents in the web server when the web server is overload- 
ed. 

2. Description of the Related Art 

[0002] An example of a data access network system 
is the Internet or Intranet network system. An Internet/ 
Intranet network system typically includes a number of 
data service systems connected together via intercon- 
nect networks. The data service systems typk^lly in- 
clude web and/or content servers that host contents for 
various customers or applrcations. The customers are 
the owners of the contents hosted in the data service 
systems such that subscribers or users can access the 
contents via their computer terminals. The content serv- 
ers typically utilize Internet applications, such as elec- 
tronic mail, bulletin boards, news groups, and World 
Wide Web access. The hosted contents are arranged in 
the form of content sites within the content servers. Each 
site may include a number of pages (e.g., world wide 
web pages). A content site is typically for one customer 
of the server while a partk:ular customer may own a 
number of content sites. The content servers may also 
be referred to as web servers. 

[0003] Often, a web page is formed from a number of 
data and/or executable program files, such as text files, 
graphics files, audio files, video files, and applbatbn 
program files. Each of the files is referred to as an "ob- 
ject". The web pages can be accessed by a user at a 
user terminal (e.g., a personal computer system or a 
web access devk;e) connected to any one of the data 
service systems. 

[0004] As is known, access to the web pages via In- 
ternet is typically stmctured around the HTTP (Hyper 
Text Transfer Protocol) protocol. The HTTP protcx:ol is 
a request-and- response protocol. When a user at a cli- 
ent device designates a particular web page, at least 
one request is generated. The number of requests is de- 
pendent upon the sophistication of the designated web 
page. As described above, a web page may include one 
or more "objects". A multi-object page is aesthetically 
pleasing, but each object requires a separate request 
and a separate response. Therefore, the time for each 
request-and-response round trip plays a role in deter- 
mining the total time a user must wait to view the com- 
plete web page. 

[0005] A web server can be accessed by multiple us- 
ers at the same time. The web server typrcally handles 



the user access requests in the first-come-first-served 
(FIFO) fashion. One disadvantage of this type of prior 
art web server structure is that the web server Is not 
equipped with protection mechanism against excessive 

s load conditions. As is known, a web server typk:ally has 
limits on the number of requests it can handle per sec- 
ond. In addition, the web server also has limits on the 
response time and bandwidth that can be served. Under 
high bads, the web sen/er can be overwhelmed with re- 

10 quests, resulting in bnger response time and poor per- 
formance for users. Thus, when the number of user ac- 
cess requests to the web server greatly exceeds the 
processing capacity of the web server (i.e. , overloaded), 
the web server either handles a request in an unbeara- 

15 bty long period of time or simply becorhes non respon- 
sive. The sharp increase in server response time may 
also cause user connections to time out, creating the 
perception that the server is dovwi or crashed. In mis- 
sk)n-critical applications, such as e-commerce, per- 

20 ceived sen/er downtime may result in loss of sales and/ 
or other financial losses. 

SUMMARY OF THE INVENTION 

25 [0006] One feature of the present invention is to im- 
prove performance of a web server 
[0007] Another feature of the present invention is to 
increase performance of a web server by sending user 
access requests for the web server from less resource 

30 intensive contents in the web server when the web serv- 
er is to be overloaded. 

[0008] A further feature of the present invention is to 
provide an adaptive web server that, when in overload 
situation, routes the user access requests to the less 
35 resource intensive contents in the web server such that 
the web server does not become overloaded or non- 
functional. 

[0009] A data service system in a data access net- 
work system includes a content server that stores con- 

40 tent files for access by extemal access requests. Each 
of the content tiles is stored in a full content format and 
any number of adapted or degraded content formats 
which are less resource-intensive to serve than the full 
content format. The data service system also includes 

4S an adaptive load control system which is coupled to the 
content server to pass the access requests to the con- 
tent server. The adaptive load control system modifies 
an access request to access the corresponding content 
file in one of the adapted content formats when the con- 

50 tent server is in an overload condition such that the con- 
tent server can be maintained at safe load conditions. 
[001 0] In a data sen/ice system of a data access net- 
work system having a content server that stores content 
flies for access by external access requests, a method 

55 of maintaining the content server at safe load conditions 
includes the step of determining load condition of the 
content sender when the data sen^ice system receives 
an access request to access one of the content files 
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stored in the content server. This step can be performed 
internally in the content sender or external to the content 
server. If the content server is determined to be in an 
overload condition, then the access request is modified 
to access the corresponding content file In an adapted s 
content format which is less resource-intensive to serve 
than the same file in a full content format such that the 
content server is maintained at the safe load conditions. 
[0011] Other features and advantages of the present 
Invention will become apparent from the following de- io 
tailed description, taken in conjunction with the accom- 
panying drawings, illustrating by way of example the 
principles of the invention. 

BRIEF DESCRIPTION OF THE DRAWINGS is 

[001 2] Figure 1 shows a data access network system 
that includes a number of data service systems. 
[0013] Figure 2 shows the structure of one of the 
adaptive data service systems of Figure 1 in accordance ^ 
with one embodiment of the present invention. 
[0014] Figure 3 shows the control and feedback loop 
of the adaptive data sen^lce system of Figure 2 for partial 
degradation. 

[0015] Figure 4 schematically shows the structure of 2S 
the content sender of the adaptive data service system 
of Figure 2. 

[001 6] Figures 5 and 6 show various comparisons be- 
tween the adaptive data service system of Figure 2 and 
a prior art data service system. 30 
[001 7] Figure 7 shows a Gap Estimation method em- 
ployed by a load monitor of Figures 2-3. 

DETAILED DESCRIPTION OF THE INVENTION 

3S 

[001 8] Figure 1 shows a data access network system 
10. In one embodiment, the network system 10 Is an 
Internet system. In another embodiment, the network 
system 10 is an Intranet system. Alternatively, the data 
access network system 1 0 may be any other known net- 40 
work system that empk>ys a known communication pro- 
tocol. 

[001 9] As can be seen from Figure 1 , the data service 
system 20 is connected to a number of user terminals 
11a through 1 1 n via an interconnect network 1 2. Users 45 
at each of the user terminals 11a-11n can access the 
data sen^ice system 20 for the services provided by the 
data service system 20. The users at each of the user 
terminals 11 a-1 1 n can also access the data service sys- 
tem 1 5 for the services provided by the data service sys- so 
tern 15 via the data sen/rce system 20 and the Internet 
13. The user terminals 11 a-1 In can also be referred to 
as client devices. 

[0020] Each of the user terminals 1 1a-11 n may be lo- 
cated at a reskJence, a school, or an office of the user ss 
Each of the user temriinals 11a-11n includes a network 
access application program (e.g., a web browser appli- 
cation program such as Netscape's Navigator) that al- 



lows the user to access the data services offered by the 
data sen/ice systems 1 5 and 20. 
[0021] Each of the user terminals 11 a-1 In can be a 
computer system or other electronk; device with data 
processing capabilities (e.g., a web TV). The intercon- 
nect network 12 can be any known network, such as 
Ethernet. ISDN (Integrated Sen/ices Digital Network). T- 
1 or T-3 link. FDD) (Fiber Distributed Data Network), ca- 
ble or wireless LMDS network or telephone line network. 

[0022] Figure 1 shows only two data service systems 
15 and 20 tor the data access network system 10 for 
illustration purposes only. In practice, the data access 
network system 1 0 may include many more data servrce 
systems. In addition, the Internet 13 is formed by a 
number of data service systems connected together via 
a network. Data communications among all data service 
systems are conducted using a predetermined commu- 
nk^ation protocol for Internet/Intranet communications. 
In one emt>odiment, the communication protocol is the 
Hyper Text Transport Protocol (HTTP). Alternatively, 
other known communication protocols for Internet/In- 
tranet communications can also be used. 
[0023] Each of the data service systems 15 and 20 
has substantially the same functional structure, which 
will be described in more detail below. Each of the data 
service systems 1 5 and 20 can be employed by an In- 
ternet/Intranet Service Provider (ISP) to offer data serv- 
k;es (e.g.. web, news, or advertisement) and other serv- 
k:es (e.g., e-commerce, e-mail) to users or subscribers 
connected to the data access system 10 via one of the 
data service systems 1 5 and 20. 
[0024] Each of the data service systems 15 and 20 
includes a number of servers (e.g.. web senders, e-mail 
servers, news servers, e-commerce servers, domain 
name servers, address assignment servers, proxy serv- 
ers, advertisement servers, and session manager serv- 
ers). The web servers, e-mail servers, news servers, e- 
commerce servers, and advertisement servers can be 
collectively referred to as local servk:e servers or con- 
tent servers. A content server typbally stores a number 
of content files that include Hyper-Text Markup Lan- 
guage (HTML) web pages, gif images, video clips, etc. 
Data transfers to and from the content servers are ena- 
bled by transport protocols such as Transport Control 
Protocol (TCP) and the User Datagram Protocol (UDP). 
The content servers support a variety of Internet appli- 
cations to provide services such as access to the World 
Wide Web, electronic mail, bulletin boards, chat rooms, 
news groups, and e-commerce. Using a currently com- 
mercially available web browser and other client appli- 
cations, users via their respective user terminals can ac- 
cess the content files (e.g., web pages, news, images, 
e-mails,) stored in the content servers. 
[0025] As described above, the access to the content 
servers is structured around the HTTP protocol which is 
a request-and-response protocol. When a user at a user 
terminal desires to access a content file stored in a con- 
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tent server of one of the data sen/ice systems 15 and 
20. at least one request is generated and sent to the 
content server. The content sender can handle multiple 
requests at the same time. However, the content server 
has limits on the number of requests it can handle per 
second. When the number of requests received by the 
content server greatly exceeds the limits of the content 
server, the content server will be In an overload condi- 
tion, resulting in at least unbearably longer response 
time and poor performance for users. This means when 
the request rate increases beyond the server capacity, 
server performance deteriorates dramatically, potential- 
ly causing a service outage. In addition, when a client 
issues a request and does not get any response, the 
client is likely to issue several more subsequent re- 
quests. This quickly increases the number of requests 
received by the content server. To resolve the problem, 
each of the data service systems 1 5 and 20, in accord- 
ance with one embodiment of the present invention, in- 
cludes an adaptive toad control mechanism to control 
the toad condition of its content servers such that each 
content server does not become overloaded. Alterna- 
tively, the load control mechanism of the present inven- 
tion can also be applied to other servers of each of the 
data service systems 15 and 20. The adaptive load con- 
trol mechanism is described in more detail betow, also 
in conjunctton with Figures 2-6. 
[0026] As can be seen from Figure 2, a data service 
system 30 in accordance with one embodiment of the 
present invention is shown to include a content server 
31 . The data sen/ice system 30 also includes an adap- 
tive load control system 40 that controls the load condi- 
tion of the content server 31 in accordance with one em- 
bodiment of the present invention. Thus, the data serv- 
ice system 30 can also be referred to as adaptive data 
service system or adaptive web server. The data service 
system 30 can be any one of the data sen/toe systems 
(e.g., the data sen^lce system 15 or 20) of Figure 1. 
[0027] As will be described in more detail below, the 
content server 31 stores content files or dynamic exe- 
cutable code/program for access by access requests. 
Thus, the content files hereinafter refer to (1 ) static con- 
tent files, (2) dynamic content files, and (3) executable 
programs/codes. Each of the content files is stored in a 
full content format and an adapted or degraded content 
format. A content file in the adapted content format Is 
much smaller in size and less resource Intensive than 
the same file in the full content format. The adaptive load 
control system 40 passes the access requests to the 
content server 31 when receiving the requests. When 
the content sender 31 is in an overload condition or about 
to be in the overload condition, the adaptive load control 
system 40 modifies the access requests to access their 
corresponding content files in the adapted content for- 
mat. When the adaptive load control system 40 deter- 
mines that the content sender 31 is not in the overload 
conditton, the adaptive load control system 40 modifies 
the access requests to access their corresponding con- 
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tent files in the full content format. This allows the con- 
tent server 31 to be maintained at safe load conditions. 
Alternatively, the adaptive load control system 40 can 
adapt the content sen/er 31 based not on the toad con- 

s dition of the content server 31 , but on the client's abilities 
(e.g., network connection of the client device, latency, 
or display capability of the client device). The data sen^- 
ice system 30 will be described In more detail betow, 
also in connection with Figures 2-6. 

10 [0028] The data sewtoe system 30 can be ffnplemenl- 
ed in a computer system or other data processing sys- 
tem. The computer system that implements the data 
service system 30 can be server computer system, a 
workstation computer system, a personal computer sys- 

is tem, or a mainframe computer system, a notebook com- 
puter system, or any other computer system. 
[0029] The data sen/toe system 30 includes a network 
interface 35 that interfaces the adaptive load control 
system 40. The adaptive toad control system 40 then 

20 interfaces with the content server 31 . The network inter- 
face 35 serves as the interface of the data servtoe sys- 
tem 30 with external network (not shown). The interface 
35 can be implemented by any known network interface 
technology, and thus will not be described in more detail 

2S below. The network Interface 35 receives external re- 
quests for the content sender 31 and passes the re- 
quests to the content server 31 via the adaptive load 
control system 40. 

[0030] The content server 31 can be any kind of con- 

30 tent server that stores a number of content files. Each 
of the content files can be accessed by an access re- 
quest. The content server 31 may also include a number 
of content sites, each storing a number of content files 
for access by multiple access requests. The multiple 

3S content sites may belong to different content providers 
or customers with potential confltoting interests. In this 
case, the adaptive load control system 40 causes the 
content server 31 to be a number of performance-iso- 
lated virtual content servers, which will be described in 

40 more detail below. Each of the servers has a capability 
to provide independent performance guarantees re- 
gardless of the toad on the other virtual servers. 
[0031] The content server 31 can be a static server or 
dynamic server. In one embodiment, the content server 

4S 31 is a static content server that stores static content 
files only. In another embodiment, the content server 31 
may store both statb and dynamic content files. As is 
known, web content is generally classified as static, 
such as a file, or dynamic, such as cgi-scripts. Dynamic 

50 content may be generated at run-time by a back-end en- 
gine (e.g., a database engine) that is separate from the 
server itself. Any adaptton at the back-end engine is not 
handled by the adaptive load control system 40. 
[0032] In accordance with one embodiment of the 

55 present inventton, each of the content files stored In the 
content server 31 is stored in multiple formats or ver- 
sions. For example, a content file can be stored in Its 
original full content format or vers ton. The same content 
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file is also stored in a degraded or adapted format or 
version in the content server 31 . Figure 4 shows an ex- 
ample of such arrangement. As can be seen from Figure 
4, the content file in the degraded format 72 Is smaller 
in size and worse in viewing quality than the same file 
in the full content format or version 71 . This means that 
the degraded format of the content file requires less re- 
source (e.g., bandwidth) to serve than the counterpart 
full content format of the same content file. This is pos- 
sible because a content file can be shown, for example, 
at 64X64 pixel resolution, at 128X128 pixel resolution, 
at 256X256 pixel resolution, or at higher or lower reso- 
lutions. The 256X256 pixel resolution version of a con- 
tent file clearly has more image data and requires more 
resource to transfer than the 128X128 pixel resolution 
version of the same content file. Thus, if the full content 
version of a content file is at the 256X256 pixel resolu- 
tion, the degraded version of the same content file can 
be at 128X128 pixel resolution, or at 64X64 pixel reso- 
lution. In summary, the degraded format of a content file 
may include images of less resolution, fewer embedded 
images, simplified or simpler pages without back- 
grounds or fewer hyper-links than the full content format 
of the same content file. Alternatively, each of the con- 
tent files stored in the content server 31 has several de- 
graded versions of the same content file. The content 
server 31 will be described in more detail below with only 
two versions (i.e., the full content version and one de- 
graded versran), for example, for each of the content 
files stored in the content server 31 for illustratk)n pur- 
poses only. 

[0033] Referring again to Figure 2, each access re- 
quest specifies a content file stored in the content server 
31 using the Universal Resource Locator (URL). The 
URL path for a content file stored in the content server 
31. however, does not specify which version or format 
from whrch the content file will be accessed. This means 
that multiple content trees contain different versions of 
the same URL. The path to a particular URL in a given 
content tree will be the concatenation of the content tree 
name and the URL name, prefixed by the name of the 
root sen/ice directory of the content server 31. For ex- 
ample, two content trees may be created in the root 
sen^ice directory Vroot", one being VfulLcontenf and 
the other ■/degraded_content". The "/fulLcontenf con- 
tent tree name directs the request to access the corre- 
sponding content file in the full content versbn or format. 
The 7degraded_contenf content tree name directs the 
request to access the corresponding content file in the 
degraded content version or format. In this case, a URL 
of ■http://www.hpl.hp.com/my_pk;ture.jpg" may be 
sensed from either the directory Vroot/fuILcontent/ 
my_picture.jpg', or from the directory "root/ 
degraded_content/my_pk:ture.jpg'. 
[0034] The scheme applies even if the content file is 
dynamic content, such as the content generated by cgi- 
scripts. Multiple content trees may contain different ver- 
sions or formats of the named cgi-scripts (e.g., 



■my_script.cgi*). The Vdegraded_content/' version or 
format is a less resource-intensive search script that, for 
example, looks only for the first five matches, instead of 
the "/fulLcontent" version or format that looks for one 

5 hundred matches. The URL of the script is then prefixed 
by the right tree name to determine which version of the 
script to execute under given load conditions. For ex- 
ample, the URL of ■httpy/www.hpl.hp.corWmy_script. 
cgi" may be modified to become either "http://www.hpl. 

10 hp.com/root/full_content/cgi-bin/my_script.cgi" or "http: 
//www.hpt.hp.com/root/degraded_content/cgi-bin/ 
my_script.cgi". 

[0035] Alternatively, the less resource-intensive "/ 
degraded_contenf search script can be replaq^d or 

IS substituted with a static version or fomiat. This can be 
done by switching to a different content tree. For exam- 
ple, an on-line vendor using the content sender 31 can 
use a dynamically generated version of their product 
catalog that interacts with a stock database to display 

20 the items currently in stock. When the content server 31 
is overbaded, its statically pre-stored catabg version Is 
then accessed and displayed. The reason for doing this 
is that a dynamic script, even degraded or simplified, 
still consumes more resources to serve than the static 

2S content. 

[0036] The adaptive k)ad control system 40 is con- 
nected to the content server 31 . In accordance with one 
embodiment of the present invention, the adaptive load 
control system 40 detects overload condition of the con- 

30 tent server 31 and causes the access requests to ac- 
cess the corresponding file from one of several alterna- 
tive content qualities for delivery In accordance with the 
load conditions. When the adaptive load control system 
40 detects that the content server 31 is in the overload 

35 conditions, the adaptive load control system 40 causes 
the incoming access requests to the content server 31 
to access the corresponding content files in the degrad- 
ed format which is less resource-intensive to serve. The 
degraded format reduces the size of responses. The de- 

40 graded format also reduces the total number of requests 
as the simplified content includes fewer embedded im- 
ages and/or other objects. The degraded format also re- 
moves or reduces links embedded in the requested con- 
tent file. Thus, the degraded format is less resource in- 

45 tensive to serve than the full content formal 

[0037] The adaptive toad control system 40 is also 
able to reject access requests if needed. In addition, the 
adaptive load control system 40 supports coexistence 
of multiple performance-isolated virtual servers such 

50 that performance levels and load adaption decisions of 
virtual sen/er do not affect that of another virtual server. 
This is required when the content sender 31 hosts mul- 
tiple content sites that possibly belong to different cus- 
tomers or content providers with potentially conflicting 

56 interests. The total load on the server 31 may not be 
evenly distributed among the hosted sites. In this case, 
the adaptive load control system 40 implements a per- 
formance isolation mechanism that does not penalize a 
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site for excessive load incurred by another site. Using 
this mechanism, the content server 31 is viewed as a 
collection of one or more performance-isolated virtual 
senders, each with a capability to provide independent 
performance guarantees regardless of the load on other 
virtual servers, if any. Guarantees are declared in a serv- 
ice level agreement associated with each virtual server 
and are used to configure the adaptive load control sys- 
tem 40 for appropriate resource capacity allocation. The 
load adaptive control system 40 makes adaption deci- 
sions within each virtual server of the content server 31 
on its load and resource allocation. 
[0038] Moreover, the adaptive load control system 40 
also gives preferential treatment to guaranteed service 
classes, while allowing non-guaranteed service classes 
to consume the excess capability when available. The 
guaranteed sewice classes are governed by service 
level agreements. The non-guaranteed classes are typ- 
ically served on best effort basis with available resourc- 
es. Under overload conditions, the adaptive load control 
system 40 should direct non-guaranteed requests to 
their degraded content files first. An arbitrary policy can 
govern the relative degradation of client classes with dif- 
ferent importance levels. For example, under overload 
conditbns, the class A requests should be sent to the 
degraded content files before class B requests. But if 
resources remain scarce, class B requests are then de- 
graded before rejecting class A requests. 
[0039] The adaptive load control system 40 can be im- 
plemented at any point in the data service system 30 
where the sender's requests and responses can be ac- 
cessed. This means that the adaptive load control sys- 
tem 40 can be implemented in the web server software, 
in the UNIX socket library, or in the operating system of 
a computer system that embodies the data service sys- 
tem 30. Moreover, the adaptive load control system 40 
can also be implemented In the gateway that Is externa! 
to the server, as well as transparent to the server. 
[0040] Basically, the adaptive load control system 40 
requires three entry points. They are (1) initialization 
point, (2) request pre-processing point, and (3) request 
post-processing point. The initialization point initializes 
internally used data structures, and creates a separate 
process that implements various components of the 
adaptive toad control system 40 if desired. The request 
pre-processing point decides the quality of content level 
of the given request. The content adaptor 41 of the 
adaptive toad control system 40 is called from this point. 
The content adaptor 41 may communicate with the other 
components via shared memory. The request post- 
processing point is an opt tonal point which monitors re- 
sponse size. 

[0041] If server code is not available, the adaptive 
load control system 40 can be implemented In a middle- 
ware (e.g., a dynamically linked socket library) used by 
the server 31. In a UNIX environment, as part of server 
initialization, the sender socket is created and a listenQ 
call Is made to the socket library to listen for incoming 



10 

requests. This library routine can be modified to call the 
Initialization point of the middleware. The modification 
can be transparent to the content server 31. The func- 
tional structure of the adaptive load control system 40 
s Is described in nrK>re detail below, also in conjunction 
with Figures 2-7. 

[0042] As can be seen from Figure 2, the adaptive 

load control system 40 includes a request classifier 33 
connected to the network interface 35. The adaptive 

10 load control system 40 also includes a load monitor 32 
connected to the request classifier 33. A content adapter 
41 is then connected to the load nrK>nitor 32 and the con- 
tent server 31. An adaptton controller 50 is connected 
to the content adapter 41 and the load monitor 32. The 

IS components 32-33, 41 , and 50 together form the adap- 
tive toad control system 40. Altematively, the adaptive 
toad control system 40 may include more or fewer com- 
ponents than above mentioned. For example, the adap- 
tive load control system 40 may functton without the re- 

^0 quest classifier 33. 

[0043] The request classifier 33 receives the access 
requests from the network interface 35. The request 
classifier 33 is used to classify the received access re- 
quests into a number of classes based on different clas- 

2S sification metrics. The request classifier 33 altows pref- 
erential treatment to a subset of content sites or clients. 
The request classifier 33 is used in an Installation where 
quality of service differentiation is required among dif- 
ferent classes of clients (e.g., guaranteed and non-guar- 

30 anteed clients, or clients accessing different virtual serv- 
ers). In this situation, the action taken by the content 
adapter 41 later may depend on the Identity of the client 
or the requested content. The request classifier 33 al- 
tows multiple classes of users to share the same content 

3S file or site and yet receive different treatments or per- 
formance. Class-based service is a mechanism for dif- 
ferentiating sen^lces given to individual classes. Thus, 
sennce perfornnance can be priced based on perform- 
ance or sen^ice agreements. A higher class with greater 

40 guarantee can be priced higher than a tower class that 
may offer less guarantee and more 'best effort' sen^ic- 
es. The request classifier 33 can be implemented using 
any known technology. 

[0044] The request classifier 33 then sends the clas- 
^ sified access requests to the load monitor 32. The load 
nrwnitor 32 is used to monitor the load condition of the 
content sender 31 in order to detect overtoad condition. 
The toad monitor 32 employs a number of nnonltoring 
mechanisnrK to mon'rtor the load condition of the content 
50 server 31 . One way of monitoring the load condition of 
the content sender 31 is to monitor the response time of 
the content server 31 , This is because response time Is 
proportional to the length of the Input request queue of 
the content sender 31 (e.g., the server's socket listen 
ss queue in a UNIX implementation). When the server is 
under-loaded, the queue tends, to be short (or empty), 
resulting In short response time. When the content serv- 
er 31 is overloaded, the request queue within the con- 
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tent server 31 overflows, making the response time 
grow an order of magnitude. This serves as a clear over- 
load indicator. The advantage of this approach (i.e., es- 
timating queue length by monitoring response time) is 
that the mechanism can be implemented outside the 
server process and requires no nrxxJification to the code 
of the content server 31 . 

[0045] Another way off monitoring whether the content 
server 31 is overloaded is to nrK>nitor utilization of the 
sender machine, as well as the response time. This al- 
lows the load monitor 32 to determine how under-uti- 
lized the sen/er is. When utilization decreases bebw a 
predetermined level, load monitor 32 may determine the 
content server 31 is no longer in the overload condition. 
[0046] In this case, in order to provide a faithful server 
load estimate, utilization measurement should be indic- 
ative of the load attributed to useful work (i.e., serving 
client requests), rather than the aggregated utilization. 
For example, the existence of a low priority process or 
thread that implements a busy-waiting loop on some 
event will render the utilization identically 100% even in 
the absence of load on the server. Arwjther concern aris- 
es when all server threads or processes have been conv 
mrtted to current requests and are blocked in I/O. The 
resulting idle time cannot be taken advantage of since 
all resources are already utilized and therefore should 
not be measured towards available capacity. 
[0047] To account for the above mentioned concerns, 
two different mechanisms are developed for the load 
monitor 32. The first is referred to as the Linear Approx- 
imation Method. Using this method, the bad monitor 32 
employs a linear function of measured request rate R 
and delivered bandwidth BW to determine the system 
utilization U, consumed by processing client requests. 
The linear function is shown as follows. 

U = aR + bBW 

wherein constants a and b can be computed by either 
on-line or off-line priori profiling as will be described be- 
low. The function is good for estimating offered load as 
long as it does not exceed server capacity. When meas- 
uring in the overload condition, the linear approximation 
method is invalid. Thus, the linear approximation meth- 
od is combined with the response time monitoring to de- 
termine the actual load condition of the server 31. This 
combined load monitoring function means that if the 
measured response time is above the predetermined 
overload threshold, U = 100%. If the measured re- 
sponse time is below the predetenmined overbad 
threshold, U = aR + bBW. 

[0048] One advantage of this linear approximation 
method is that it is easy to implement. Using this meth- 
od, each server process or thread Tj records independ- 
ently its observed request rate Rj, delivered bandwidth 
BWj, and utilization Uj. A separate monitor process or 
thread then reads and sums up the recorded values to 



compute the aggregate request rate R, bandwidth BW, 
and utilizatbn U. Synchronization of access to shared 
data structures is not required since there is only one 
writer to any piece of data. The method allows server 
s capacity planning for quality of servbe guarantees, amj 
it provides means for converting the desired request rate 
and bandwidth guarantee into a corresponding resource 
capacity allocation (I.e., allocated utilization). This meth- 
od, however, requires computing the constants a and b 

10 using the priori profiling. 

[0049] A simple way of computing the constants a and 
b using the priori profiling Is to obtain several measure- 
ments of U and the corresponding R and BW. Then a 
linear regression technique is employed to determine 

IS the constants a and b that best fit the equation U =aR + 
bBW. Note that R and BW can be measured online by 
counting requests seen and bytes delivered within a giv- 
en period. Utilization U can be measured using a Gap 
Estimation method which will be described later. Given 

20 the measured values of R, BW, and U, at successive 
time Intervals, estimation theory provkJes a way to find 
a linear fit that minimizes the error. It provkJes the nec- 
essary formulae for computing and updating the param- 
eters a and b online in view of successive new meas- 

25 urements. 

[0050] Alternatively, the parameters a and b can be 
determined by testing the content sender 31 with a pre- 
specffied workload. Testing can proceed by requesting 
a URL of a given size at an Increasing rate until client 

30 connectbns start I im ing out, indicating server overload. 
The maximum attained request rate and bandwidth are 
then recorded. For our purposes, sen/er utilization can 
be assumed to be 100% at this load condition. The ex- 
periment is repeated for different sizes of the requested 

55 URL, giving a different rale and bandwidth combination 
that saturates the sen/er 31 . The set of R, BW, and U = 
100% points are used to construct the line aR + bBW = 
100 on an R, BW, plane. The line intersects the R and 
BW axes at 100/a and 100/b respectively, from which a 

40 and b are found. 

[0051] The second mechanism for the load monitor 
32 is referred to as the Gap Estimatbn Method. This 
method estimates the fraction of time that the content 
server 31 spends serving requests. Using this method, 

4S ihe load monitor 32 employs a gbbal counter to track 
the requests. The load monitor 32 Increments the coun- 
ter when a request is received and decrements the 
counter when a response departs the content sen/er 31 . 
The counter will return to its initial value only when a gap 

50 Is present (i.e., when all current requests have been 
sen/ed and no additional requests have arrived yet). By 
summing up the gaps over a period of time T, a monitor 
process or thread can compute the total "idle time' G 
within that time period T. The idle time Is the time where 

55 no requests are pending. The utilizatkjn is then estimat- 
ed as U = (T-G)^". 

[0052] The Gap EsltmatkMi method is illustrated in 
Figure 7. As can be seen from Figure 7, concurrent 
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processing of request bursts are separated with idle 
time. The method does not require a prior profiling, and 
does not require monitoring response time. The global 
counter may be updated by muhlple writers and thus re- 
quires some form of locking or access synchronization. 
[0053] Referring back to Figure 2, the load monitor 32 
sends the load condition information of the content serv- 
er 31 to the content adapter 41 and the adaption con- 
troller 50. The adaption controller 50 also receives a pre- 
determined desired load value which indicates the 
threshold of the overload condition. Based on the com- 
parison of the desired load value and the monitored load 
value, the adaptive controller 50 determines whether the 
content server 31 is in the overload condition in which 
case the adaptive controller 50 causes the content 
adapter 41 to modify the URLs of all the incoming ac- 
cess requests to access their corresponding content 
files In the degraded content format or versk)n. If the 
adaptive controller 50 determines that the content serv- 
er 31 is not in the overload conditbn in which case the 
adaptive controller 50 causes the content adapter 41 to 
modify the URLs of the incoming access requests to ac- 
cess their corresponding content files in the full content 
format or versran, 

[0054] The desired load value for the adaption con- 
troller 50 can be a predetermined threshold T of the 
sender response time. This threshold T can be set equal 
to (or slightly smaller than) the maximum sender re- 
sponse time specified in the sen^ice level agreement, if 
any. thereby causing adaption when the agreement is 
about to be violated. In the absence of such specifica- 
tion, the threshold can be derived from the pre-config- 
ured maximum input request queue length Q and the 
average service time S. For example, if we consider a 
90% full queue to be an overload indication, the thresh- 
old T can be set to T = 0.9QXS. 
[0055] The content adapter 41 effects adaption by 
changing directory links. Under the control of the adap- 
tive controller 50, the content adapter 41 modifies the 
URLs of the incoming access requests to access the 
corresponding content files in either the full content for- 
mat or one of the degraded content formats. For exam- 
ple, the content adapter 41 can modify the URL of "http: 
//www.hpl.hp.com/my_picture.jpg' as either "http:// 
www.hpl.hp.com/fulLcontent/my__picture.jpg" to access 
the content file "my.picture.jpg" in the full content for- 
mat, or •http://www.hpl.hp.com/degraded_content/ 
my_picture.jpg' to access the content file "my_picture. 
jpg' in the degraded content format. As another exam- 
ple, when the content is a dynamk: content (e.g., http:// 
www.hpl.hp.com/my_script.cgi), the content adapter 41 
can modify the URL as either "http://www.hpl.hp.com/ 
full_content/cgi-bin/my_script.cgi' to access the full 
script file, or •http://www.hpl.hp.com/degraded_content/ 
cgi-bin/my_scripLcgi' to access the less resource-inten- 
sive script 

[0056] When the adaption controller 50 determines 
that the content server 31 is in the overload conditbn, 



the adaption controller 50 causes the content adapter 
41 to modify the URLs of the incoming access requests 
to access the corresponding content files in the degrad- 
ed or adapted content formal As described above, the 

s degraded or adapted content format requires less re- 
sources to serve than the full content format. When the 
adapt bn controller 50 determines that the content serv- 
er 31 is not in the overload conditran, the adaption con- 
troller 50 causes the content adapter 41 to modify the 

10 URLs of the incoming access requests to access the 
corresponding content files in the full content format. 
[0057] As can be seen f rom Figures 2 and 4, once the 
adaptkxi trigger is fired from the adaptfon controller 50, 
the content adapter 41 switches transparently between 

IS the high quality service tree 73 (Figure 4) and the de- 
graded quality sen/ice tree 74 (Figure 4) in the root serv- 
ice directory 70 (Figure 4) based on the load condition 
of the content server 31 . The switching operation is com- 
pletely transparent to the content sender 31 . 

20 [0058] Figu re 5 shows the result of the adapt ion proc- 
ess by the adaptive load control system 40. The curve 
81 shows the percentage of connectkxi failures by a pri- 
or art sen/er without the adaptive load control system 
40. The curve 80 shows the percentage of connectbn 

25 failures by the content server 31 with the adaptive toad 
control system 40. As can be seen from Figure 5, the 
cun^e 80 shows significantly fewer connection failures 
than the prior art curve 81 as the request rate increases. 
The traditional server suffers an increasing error rate (i. 

30 e., connect ton failures) when offered load exceeds ca- 
pacity of the server 

[0059] As described above, when the adaption con- 
troller 50 determines that the content server 31 is over- 
loaded, the controller 50 causes the content adapter 41 

35 to adapt all incoming access requests to the degraded 
content format. This potentially makes the content serv- 
er 31 under-loaded, thus not achieving adequate re- 
source utilization when the adaption takes place. This 
effect is shown in Figure 6. In Figure 6, the solid line 

40 curve 85 represents the delivered bandwidth of a prior 
art content sender without the adaption mechanism In 
accordance with one embodiment of the present inven- 
tion and the broken line cun^e 86 represents the deliv- 
ered bandwkith of the content server 31 with the adap- 

45 live toad control system 40. As can be seen from Figure 
6, a sharp drop in delivered bandwidth for the curve 86 
is shown for the content server 31 when adaption takes 
place by switching to less resource-intensive content 
format. This leaves server bandwidth underutilized. 

so [0060] This can be overcome by only causing the con- 
tent adapter 41 to degrade a fraction of the incoming 
access requests when the content server 31 is in the 
overload condition. The fraction can be between nothing 
and the all of the incoming access requests. This mech- 

55 anism is implemented in the content adapter 41 . It uses 
an input control parameter G which can be thought of 
as an external control knob that tunes the extent of par- 
tial degradatton. The extremes are G = Max, where no 
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request is degraded, and G = 0, where all incoming re- 
quests are degraded. Thus, decreasing G will decrease 

server load. 

[0061] Assuming the total number of available content 
trees are Max and the content trees are numbered from 
1 to Max in increasing order of quality (tree 0 is a special 
tree that stands for request rejection). Let I be the inte- 
gral part of G and let F be the fractional part. The follow- 
ing rule is used: 

If G is an integer (i.e., G=l, F=0), the content adapter 
41 lets every request be served from the tree I. 
If G is not an integer, the content adapter 41 com- 
putes a pseudo-random number N (in the range 
[0,1]) upon the receipt of each request. If N<F, the 
request is served from tree 1+1. Otherwise it is 
served from tree I. 

[0062] This algorithm provides a continuous partial 
degradation spectrum that ranges between serving alt 
requests from the highest quality content tree and re- 
jecting all requests. Next, a self-regulating technique will 
be described, in conjunction with Figure 3, that uses 
mon itoring and feedback to automatically select the best 
value of G such that the overload is prevented and a 
specified target utilization is maintained. 
[0063] Figure 3 shows the control and feedback loop 
of the adaptive load control system 40 of Figure 2 for 
self -regulating partial degradation in accordance with 
another embodiment of the present invention. In this 
embodiment as shown in Figure 3, a partial degradation 
module 60 is used to control the partial degradation. The 
module 60 includes a integral controller 60 and an adder 
61 . The achieved server utilization from the load monitor 
32 is fed to the adder 61 of the module 60. The desired 
load utilization is also fed to the adder 61 . The end result 
E (i.e., the d'rfference between the two utilizations) from 
the adder 61 is then fed to the integral controller 62. The 
integral controller 62 then adjusts the control parameter 
G to control the content adapter 41 to adjust the number 
of access requests for adaptk>n. Using the feedback, the 
content adapter 41 can quickly and automatically adjust 
the number of degraded requests to be around the tar- 
get or desired utilization. 

[0064] In the foregoing specification, the invention 
has been described with reference to specific embodi- 
ments thereof. It will, however, be evident to those 
skilled in the art that various modifications and changes 
may be made thereto without departing from the broader 
spirit and scope of the invention. The specification and 
drawings are, accordingly, to be regarded in an illustra- 
tive rather than a restrictive sense. 



Claims 



a content server that stores content files for ac- 
cess by external access requests, wherein 
each of the content files is stored in a full con- 
tent format and an adapted content format 
which is less resource-intensive to serve than 
the full content format; 

an adaptive load control system coupled to the 
content server to pass the access requests to 
the content server, wherein the adaptive load 
control system modifies an access request to 
access the corresponding content file in the 
adapted content format when the content serv- 
er is in an overload condition such that the con- 
tent server is maintained at safe load condi- 
tions. 

2. The data servk:e system of claim 1, wherein the 
adaptive load control system modifies the access 
request to access the corresponding content file in 
the full content format when the content server is 
not in the overload condition. 

3. The data service system of claim 1 , wherein the 
adaptive load control system further comprises 

a load monitor that monitors the load condition 
of the content server; 

a content adapter coupled to the load monitor 
and the content server to nrKxiify the access re- 
quest to access the corresponding content file 
in the adapted content format when the load 
rTK)nitor indicates that the content server is in 
the overk)ad condition. 

The data service system of claim 3, wherein the 
adaptive load control system further comprises an 
adaptbn controller coupled to the load monitor and 
the content adapter to cause the content adapter to 
rrKxiify the access request to access the corre- 
sponding content file in the adapted content format 
when the load monitor indicates that the content 
sender is in the overload conditbn. 

The data service system of claim 4, wherein the 
adaption controller determines if the content server 
is in the overload condition by comparing the load 
informal bn received by the load monitor against a 
predetermined desired bad value of the content 
server. 

6. The data service system of claim 3. wherein the 
content adapter nrKxJifies the access request to ac- 
cess the corresponding content file in the full con- 
tent format when the load monitor indicates that the 
55 content sender is not in the overload condition: 
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A data service system in a data servbe network sys- 
tem, comprising: 



The data servbe system of claim 1 , wherein the 
content adapter modifies the access request by 
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modifying the URL (Universal Resource Locator) of 
the access request. 

8. The data service system of claim 1, wherein for 
each of the content files, the content server includes s 
a service directory that directs the modified access 
request to access the corresponding content file in 
either the full content format or the adapted content 
fornDat. 

10 

9. In a data service system of a data access network 
system having a content server that stores content 
files for access by external access requests, a 
method of maintaining the content server at safe 
load conditions, comprising the steps of: is 

determining bad condition of the content server 
when the data service system receives an ac- 
cess request to access one of the content files 
stored in the content server; 20 
if the content server is determined to be in an 
overload condition, then nrKxIifylng the access 
request to access the corresponding content 
file in an adapted content format which is less 
resource-intensive to serve than the same file 25 
in a full content format such that the content 
server is maintained at the safe load conditions. 



formed either within the content server or external 
to the content sender. 



10. The method of claim 9. further comprising the step 
of modifying the access request to access the cor- 30 
responding content file in a full content fonDat when 
the content server is determined not to be in the 
overload condition. 



11. The method of claim 9, wherein the step of deter- 3S 
mining load condition further comprises the steps of 

obtaining the actual load condition of the con- 
tent server using a load monitor; 
comparing the actual load condition with a pre- 
determined desired load condition to determine 
if the content server is in the overload condition. 



12. The method of claim 9, wherein the step of modify- 
ing the access request is performed by modifying 
the URL of the access request. 

13. The method of claim 10. wherein the step of modi- 
fying the access request is performed by modifying 
the URL of the access request. so 

14. The method of claim 9, further comprising the step 
of directing the modified access request to access 
the corresponding content file in either the full con- 
tent format or the adapted content format. ss 

15. The method of claim 9, wherein the step of deter- 
mining load condition of the content server is per- 
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